content metadata
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Belavadi, Vibha, Vatsa, Tushar, Sultania, Dewang, Suresha, Suhas, Verma, Ishita, Chen, Cheng, King, Tracy Holloway, Friedrich, Michael
This paper addresses fine-tuning Large Language Models (LLMs) for function calling tasks when real user interaction data is unavailable. In digital content creation tools, where users express their needs through natural language queries that must be mapped to API calls, the lack of real-world task-specific data and privacy constraints for training on it necessitate synthetic data generation. Existing approaches to synthetic data generation fall short in diversity and complexity, failing to replicate real-world data distributions and leading to suboptimal performance after LLM fine-tuning. We present a novel router-based architecture that leverages domain resources like content metadata and structured knowledge graphs, along with text-to-text and vision-to-text language models to generate high-quality synthetic training data. Our architecture's flexible routing mechanism enables synthetic data generation that matches observed real-world distributions, addressing a fundamental limitation of traditional approaches. Evaluation on a comprehensive set of real user queries demonstrates significant improvements in both function classification accuracy and API parameter selection. Models fine-tuned with our synthetic data consistently outperform traditional approaches, establishing new benchmarks for function calling tasks.
Beyond Labels: Leveraging Deep Learning and LLMs for Content Metadata
Agrawal, Saurabh, Trenkle, John, Kawale, Jaya
Content metadata plays a very important role in movie recommender systems as it provides valuable information about various aspects of a movie such as genre, cast, plot synopsis, box office summary, etc. Analyzing the metadata can help understand the user preferences to generate personalized recommendations and item cold starting. In this talk, we will focus on one particular type of metadata - \textit{genre} labels. Genre labels associated with a movie or a TV series help categorize a collection of titles into different themes and correspondingly setting up the audience expectation. We present some of the challenges associated with using genre label information and propose a new way of examining the genre information that we call as the \textit{Genre Spectrum}. The Genre Spectrum helps capture the various nuanced genres in a title and our offline and online experiments corroborate the effectiveness of the approach. Furthermore, we also talk about applications of LLMs in augmenting content metadata which could eventually be used to achieve effective organization of recommendations in user's 2-D home-grid.
Content metadata: why keyword extraction requires automated labelling -- EDIA
Keywords are no science but an art. There is no such thing as'the right keyword,' as we're talking about a core concept incorporated into a piece of content in the broadest form. Texts don't necessarily need to contain an exact keyword. For example, if the term'European Union' is used several times, 'European Commission' may be a suitable keyword even though the writer never uses the term. Despite this fluid definition, keywords should be understandable to those who try to find the right ones.
How Machine Learning Is Changing the Game for Content Metadata
These are the best of times for entertainment content owners and distributors--but they are also very challenging times. There is more content--often great content--than ever before and also vastly more competition due to the rise of streaming services, as well as on-demand options. This presents a challenge for content owners and distributors: how to stand out from the crowd and help viewers find what they want. Awash in all that content--not just professionally produced long-form content, but also highly viral digital-first content--viewers have a hard time wading through it all. In fact, it would take a single viewer more than 5 million years to watch the amount of video that crosses global IP networks each month, according to a recent Cisco Systems report. That's why it's imperative for content owners and distributors to make it easy for viewers to search and discover their content.